Name | Version | Summary | date |
langsmith |
0.3.9 |
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform. |
2025-02-21 01:56:42 |
python-lilypad |
0.0.16 |
An open-source prompt engineering framework. |
2025-02-19 20:10:48 |
agenta |
0.33.5 |
The SDK for agenta is an open-source LLMOps platform. |
2025-02-12 14:15:01 |
providentia |
2.4.0 |
Providentia is designed to allow on-the-fly, offline and interactive analysis of experiment outputs, with respect to processed observational data. |
2025-02-12 13:36:50 |
maihem |
1.7.3 |
LLM evaluations and synthetic data generation with the MAIHEM models |
2025-02-11 16:54:39 |
trust_eval |
0.1.5 |
Metric to measure RAG responses with inline citations |
2025-02-11 04:42:29 |
quotientai |
0.1.5 |
CLI for evaluating large language models with Quotient |
2025-02-10 23:35:25 |
dyff-schema |
0.24.2 |
Data models for the Dyff AI auditing platform. |
2025-02-10 23:10:21 |
dyff |
0.32.0 |
Meta-package to install the local SDK for the Dyff AI auditing platform. |
2025-02-10 18:43:19 |
judges |
0.0.6 |
A small library of research-backed LLM judges |
2025-02-07 21:05:29 |
pyevalai |
0.0.7 |
Automated python exercise evaluations with AI. |
2025-02-06 01:22:32 |
nuggetizer |
0.0.5 |
A package for Nuggetizer - a tool for information nugget creation and assignment to LLM-generated answers. |
2025-02-04 23:25:12 |
dyff-audit |
0.10.5 |
Audit tools for the Dyff AI auditing platform. |
2025-02-04 05:42:40 |
evo |
1.30.6 |
Python package for the evaluation of odometry and SLAM |
2025-02-02 16:01:02 |
corec |
1.0.6 |
A Context-Aware Recommendation Framework for Python |
2025-02-01 20:17:56 |
frechet-music-distance |
1.0.0 |
A library for computing Frechet Music Distance. |
2025-01-31 17:39:54 |
tieval |
0.1.8 |
A framework for evaluation and development of temporal-aware models. |
2025-01-29 10:01:54 |
AutoRAG |
0.3.13 |
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product. |
2025-01-25 05:28:38 |
dyff-client |
0.15.2 |
Python client for the Dyff AI auditing platform. |
2025-01-24 03:56:28 |
evalscope |
0.10.1 |
EvalScope: Lightweight LLMs Evaluation Framework |
2025-01-23 05:45:05 |